WBTK: a New Set of Microbenchmarks to Explore Memory System Performance for Scientific Computing

نویسندگان

  • William Jalby
  • Christophe Lemuet
  • X. Le Pasteur
چکیده

Memory hierarchies are a key component in obtaining high performance on modern microprocessors. To satisfy the ever-increasing demand on data rate access, they are also becoming increasingly complex: multilevel caches, non-blocking caches, sophisticated instructions for supporting prefetch and cache control, etc. If all of these advanced features promise to offer large performance gains, they also generate in some cases performance “anomalies” (i.e. bad performance triggered by specific code patterns). For precisely locating and understanding these anomalies, a new set of microbenchmarks called WBTK is introduced. We show through systematic experimentation on Alpha 21264, Power4 and Itanium1 that this microbenchmark first allowed us to detect most of the anomalies encountered on simple BLAS1 type codes. Secondly, it led us to demonstrate that vectorization of memory access was an efficient workaround for most of these anomalies.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Analysis of Intel's Haswell Microarchitecture Using the ECM Model and Microbenchmarks

This paper presents an in-depth analysis of Intel’s Haswell microarchitecture for streaming loop kernels. Among the new features examined is the dual-ring Uncore design, Cluster-on-Die mode, Uncore Frequency Scaling, core improvements as new and improved execution units, as well as improvements throughout the memory hierarchy. The Execution-Cache-Memory diagnostic performance model is used toge...

متن کامل

Parallel computing using MPI and OpenMP on self-configured platform, UMZHPC.

Parallel computing is a topic of interest for a broad scientific community since it facilitates many time-consuming algorithms in different application domains.In this paper, we introduce a novel platform for parallel computing by using MPI and OpenMP programming languages based on set of networked PCs. UMZHPC is a free Linux-based parallel computing infrastructure that has been developed to cr...

متن کامل

The Effects of Memory-Rich Environments on File System Microbenchmarks

File system performance has been greatly influenced by disk caching mechanisms. As the size of memory increases, common workloads are more likely to run completely from memory, and the effects of L2 caching and underlying hardware are becoming more visible. This paper investigates performance anomalies observed when measuring and comparing the memory performance of various leading file systems....

متن کامل

Operating System Enhancements for Data-Intensive Server Systems

Recent studies on operating system support for concurrent server systems mostly target CPU-intensive workloads with light disk I/O activities. However, an important class of server systems that access a large amount of disk-resident data, such as the index searching server of large-scale Web search engines, has received limited attention. In this thesis work, we examine operating system techniq...

متن کامل

Intel Woodcrest: An Evaluation for Scientific Computing

Intel recently began shipping its Xeon 5100 series processors, formerly known by their “Woodcrest” code name. To evaluate the suitability of the Woodcrest processor for high-end scientific computing, we obtained access to a Woodcrest-based system at Intel and measured its performance first using computation and memory microbenchmarks, followed by full applications from the areas of climate mode...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • IJHPCA

دوره 18  شماره 

صفحات  -

تاریخ انتشار 2004